The Hash Postfix Table Based Metadata Management Algorithm for Mass Storage System
نویسندگان
چکیده
It is important for mass storage system to distribute access requests dynamically and balanced between several metadata servers (MDS). Based on analyzing the features of metadata management in the mass storage system, this paper designs the structure of metadata management module, proposes the hash postfix table(HPT) based metadata management algorithm, uses HPT to adjust the distribution of access request dynamically, presents the process of metadata querying and HPT quick adjustment. It analyzes the algorithm from the dynamic equilibrium capability and the time and space overhead. At last, it realizes the prototype, using real data sets to evaluating. The results show that the HPT based metadata management algorithm is more effective and flexible, and can avoid the hotspots.
منابع مشابه
FusionProv: Towards a Provenance-Aware Distributed Filesystem
It has become increasingly important to capture and understand the origins and derivation of data (its provenance). A key issue in evaluating the feasibility of data provenance is its performance, overheads, and scalability. In this paper, we explore the feasibility of a management layer for parallel file systems, in which metadata includes both file operations and provenance metadata. We desig...
متن کاملScalable Storage for Data-Intensive Computing
Cloud computing applications require a scalable, elastic and fault tolerant storage system. We survey how storage systems have evolved from the traditional distributed filesystems, peer-to-peer storage systems and how these ideas have been synthesized in current cloud computing storage systems. Then, we describe how metadata management can be improved for a file system built to support large sc...
متن کاملResearch of Data Storage and Querying Methods Based on Ring Distribut- ed Hash
In this paper, the main contributions of this work include three aspects. First, the deployment on different datacenters of Impala which is a database based on Ring Distributed Hash. This thesis deploys Impala system on different datacenters across WAN or across regions. Second, the research of data storage and search method based on circular distributed hash. This thesis adopts distributed has...
متن کاملDDSF: A Data Deduplication System Framework for Cloud Environments
Cloud storage has been widely used because it can provide seemingly unlimited storage space and flexible access way, while the rising cost of storage and communications is an issue. In this paper, we propose a Data Deduplication System Framework(DDSF) for cloud storage environments. The DDSF consists of three major components, the client, fingerprint server and storage component. The client com...
متن کاملEnabling High Data Throughput in Desktop Grids through Decentralized Data and Metadata Management: The BlobSeer Approach
Whereas traditional Desktop Grids rely on centralized servers for data management, some recent progress has been made to enable distributed, large input data, using to peer-to-peer (P2P) protocols and Content Distribution Networks (CDN). We make a step further and propose a generic, yet efficient data storage which enables the use of Desktop Grids for applications with high output data requirem...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- JDCTA
دوره 4 شماره
صفحات -
تاریخ انتشار 2010